OCR for Handwritten Kannada Language Script
نویسندگان
چکیده
The optical character recognition (OCR) is the process of converting textual scanned image into a computer editable format. The proposed OCR system is for complex handwritten Kannada characters. One of the major challenges faced by Kannada OCR system is recognition of handwritten text from an image. The input text image is subjected to preprocessing and then converted into binary image. Segmentation process is carried to extract single character from image. This can be done using connected component labeling. Hu’s invariant moments, horizontal and vertical profile features are obtained as features from zoned image. Probabilistic neural network (PNN) classifier is used for character recognition. Finally the recognized output is editable in baraha editor. An accuracy of 94.69% is achieved in character recognition of the domain specific input. Keywords— Optical Character Recognition (OCR), Feature Extraction, Neural Network, Kannada Scripts
منابع مشابه
Segmentation of Handwritten Documents Containing Kannada Script
Segmentation is one of the important phases of Optical Character Recognition (OCR) system, which extracts objects of interest from an image. Feature extraction and classification phases of OCR will be more effective, if the techniques selected for segmentation is effective. This paper focuses on to develop a system for handwritten documents containing Kannada script and proposes suitable techni...
متن کاملClassifier Fusion Method to Recognize Handwritten Kannada Numerals
Optical Character Recognition (OCR) is one of the important fields in image processing and pattern recognition domain. Handwritten character recognition has always been a challenging task. Only a little work can be traced towards the recognition of handwritten characters for the south Indian languages. Kannada is one such south Indian language which is also one of the official language of India...
متن کاملKannada Character Recognition System A Review
Intensive research has been done on optical character recognition ocr and a large number of articles have been published on this topic during the last few decades. Many commercial OCR systems are now available in the market, but most of these systems work for Roman, Chinese, Japanese and Arabic characters. There are no sufficient number of works on Indian language character recognition especial...
متن کاملA Script Recognizer Independent Bi-lingual Character Recognition System for Printed English and Kannada Documents
Department of Computer Science Amrita Vishwa Vidyapeetham, Mysore Campus Bogadi, Mysore INDIA _____________________________________________________________________________________ Abstract: Recognition of text document images is the inclination of any optical character recognition systems. This paper aims at extending the functionality of optical character recognition system to recognize more t...
متن کاملDiscrimination of English to other Indian languages (Kannada and Hindi) for OCR system
India is a multilingual multi-script country. In every state of India there are two languages one is state local language and the other is English. For example in Andhra Pradesh, a state in India, the document may contain text words in English and Telugu script. For Optical Character Recognition (OCR) of such a bilingual document, it is necessary to identify the script before feeding the text w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016